10.2 The Calculus of Causation

117

a great stumbling block in the development of quantitative causal thinking that no

mathematical notation existed to capture the results of an intervention. 4 This was

provided by Judea Pearl’s do-calculus. 5

Thus,y vertical bar d o left parenthesis x right parenthesis comma z right parenthesisy|do(x), z) signifies the probability ofupper Y equals yY = y given thatupper XX is held constant

atxx andupper Z equals zZ = z is observed. Unlike the associational models of traditional statistical

analysis, causal models (sometimes called structural models) can be used to predict

how the probabilities of events would change as a result of external interventions,

whereas associational models assume that conditions remain the same. Pearl (2001)

has given an analogy to visual perception: a precise description of the shape of a

three-dimensional object is useful and sufficient for predicting how that object will

be viewed from any angle, but insufficient for predicting how the shape might change

if it is squeezed by external forces, which requires information about the material

from which the object is made and its Young’s, bulk, etc. moduli.

Pearl has given three rules of causal inference, 6 which allow sentences concerning

interventions to be transformed into others concerning observations only. The causal

model is directed as a cyclic graph upper GG and upper X comma upper Y comma upper Z and upper WX, Y, Z and W are disjoint subsets of

variables. The rules are:

Rule 1

(insertion/deletion of observations)

upper W 2 left parenthesis y 1 t 1 comma y 2 t 2 right parenthesis equals upper W 1 left parenthesis y 1 t 1 right parenthesis upper W 1 left parenthesis y 2 t 2 right parenthesis commaP(y|(x), z, w) = P(y|(x), w) if (YZ|X, W)G ¯X .

(10.1)

Rule 2

(action/observation exchange)

upper W 2 left parenthesis y 1 t 1 comma y 2 t 2 right parenthesis equals upper W 1 left parenthesis y 1 t 1 right parenthesis upper W 1 left parenthesis y 2 t 2 right parenthesis commaP(y|(x), do(z), w) = P(y|(x), z, w) if (YZ|X, W)G ¯X Z .

(10.2)

Rule 3

(insertion/deletion of actions)

upper W 2 left parenthesis y 1 t 1 comma y 2 t 2 right parenthesis equals upper W 1 left parenthesis y 1 t 1 right parenthesis upper W 1 left parenthesis y 2 t 2 right parenthesis commaP(y|(x), (z), w) = P(y|(x), w) if (YZ|X, W)G ¯X,Z(W) .

(10.3)

In words, Rule 1 states that if a variable upper WW irrelevant to upper YY is observed, then the

probability distribution of upper YY will not change provided variable set upper ZZ blocks all the

paths fromupper WW toupper YY after having deleted all paths leading toupper XX; Rule 2 states that if a

setupper ZZ of variables blocks all paths fromupper XX toupper YY, thenMathID27(x) is equivalent to observingxx

(conditional onupper ZZ); and Rule 3 states thatMathID30(x) can be removed fromMathID31P(y|(x) whenever

there is no causal path fromupper XX toupper YY, i.e.,MathID34P(y|(x) = P(y). Huang and Valtorta (2006)

have shown that these three rules are complete, in the sense that if a causal effect is

identifiable, a sequence of operations exists that transforms the causal effect formula

into one that only includes observational quantities.

4 For an account of how statistics was able to approach relative causal effects, see Reiter (2000).

5 Pearl (1994).

6 See also Pearl (2019).